A Comparison of Logistic Regression, k-Nearest Neighbor, and Decision Tree Induction for Campaign Management

نویسندگان

  • Martin Bichler
  • Christine Kiss
چکیده

Extensive research has been performed to develop appropriate machine learning techniques for different data mining problems. However, previous work has shown that no learner is generally better than another learner. Comparing machine learning methods depends very much on the characteristics of a particular data set and the requirements of the respective business domain. Direct marketing is an important task in marketing departments, and one where machine learning techniques have been used repetedly. A systematic comparison of classifier performance can achieve considerable gains in marketing effectiveness. This case study provides an assessment of the predictive performance of different classification methods for campaign management. The evaluation of data mining methods for marketing campaigns has special requirements. Whereas, typically the overall performance is an important selection criteria, for campaign management it is more important to select the technique which performs best on the first few quantiles. This study selects candidate techniques and relevant evaluation criteria for campaign management and provides a guideline for similar comparison studies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparing Classification Methods for Campaign Management: A Comparison of Logistic Regression, k-Nearest Neighbour, and Decision Tree Induction

Extensive research has been performed to develop appropriate machine learning techniques for different data mining problems, and has led to a proliferation of different learning algorithms. However, previous work has shown that no learner is generally better than another learner. For repeated data mining tasks practitioners have started conducting empirical comparison studies to determine the b...

متن کامل

Using the SAS® System and SAS® Enterprise MinerTM for Data Mining: A study of Cancer Survival at Mayo Clinic

This paper evaluates and predicts a certain epidemiological (cancer survival) condition using data-mining techniques in SAS®. A data set that contains information about the survival of lung-cancer patients from a study at the Mayo Clinic was extracted from the R survival package. Data-mining techniques—namely linear and logistic regression models, regression and classification trees, and neares...

متن کامل

Detection of some Tree Species from Terrestrial Laser Scanner Point Cloud Data Using Support-vector Machine and Nearest Neighborhood Algorithms

acquisition field reference data using conventional methods due to limited and time-consuming data from a single tree in recent years, to generate reference data for forest studies using terrestrial laser scanner data, aerial laser scanner data, radar and Optics has become commonplace, and complete, accurate 3D data from a single tree or reference trees can be recorded. The detection and identi...

متن کامل

کاربرد الگوریتم‌های داده‌کاوی در تفکیک منابع رسوبی حوزۀ آبخیز نوده گناباد

Introduction: Reduction of sediment supply requires the implementation of soil conservation and sediment control programs in the form of watershed management plans. Sediment control programs require identifying the relative importance of sediment sources, their quantitative ascription and identification of critical areas within the watersheds. The sediment source ascription is involves two...

متن کامل

Comparison of Classification Methods Based on the Type of Attributes and Sample Size

In this paper, the efficacy of seven data classification methods; Decision Tree (DT), k-Nearest Neighbor (k-NN), Logistic Regression (LogR), Naïve Bayes (NB), C4.5, Support Vector Machine (SVM) and Linear Classifier (LC) with regard to the Area Under Curve (AUC) metric have been compared. The effects of parameters including size of the dataset, kind of the independent attributes, and the number...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004